975 research outputs found

    Resolving transition metal chemical space: feature selection for machine learning and structure-property relationships

    Full text link
    Machine learning (ML) of quantum mechanical properties shows promise for accelerating chemical discovery. For transition metal chemistry where accurate calculations are computationally costly and available training data sets are small, the molecular representation becomes a critical ingredient in ML model predictive accuracy. We introduce a series of revised autocorrelation functions (RACs) that encode relationships between the heuristic atomic properties (e.g., size, connectivity, and electronegativity) on a molecular graph. We alter the starting point, scope, and nature of the quantities evaluated in standard ACs to make these RACs amenable to inorganic chemistry. On an organic molecule set, we first demonstrate superior standard AC performance to other presently-available topological descriptors for ML model training, with mean unsigned errors (MUEs) for atomization energies on set-aside test molecules as low as 6 kcal/mol. For inorganic chemistry, our RACs yield 1 kcal/mol ML MUEs on set-aside test molecules in spin-state splitting in comparison to 15-20x higher errors from feature sets that encode whole-molecule structural information. Systematic feature selection methods including univariate filtering, recursive feature elimination, and direct optimization (e.g., random forest and LASSO) are compared. Random-forest- or LASSO-selected subsets 4-5x smaller than RAC-155 produce sub- to 1-kcal/mol spin-splitting MUEs, with good transferability to metal-ligand bond length prediction (0.004-5 {\AA} MUE) and redox potential on a smaller data set (0.2-0.3 eV MUE). Evaluation of feature selection results across property sets reveals the relative importance of local, electronic descriptors (e.g., electronegativity, atomic number) in spin-splitting and distal, steric effects in redox potential and bond lengths.Comment: 43 double spaced pages, 11 figures, 4 table

    Traumatic injury and exposure to mitochondrial-derived damage associated molecular patterns suppresses neutrophil extracellular trap formation

    Get PDF
    Major traumatic injury induces significant remodeling of the circulating neutrophil pool and loss of bactericidal function. Although a well-described phenomenon, research to date has only analyzed blood samples acquired post-hospital admission, and the mechanisms that initiate compromised neutrophil function post-injury are therefore poorly understood. Here, we analyzed pre-hospital blood samples acquired from 62 adult trauma patients (mean age 44 years, range 19–95 years) within 1 h of injury (mean time to sample 39 min, range 13–59 min). We found an immediate impairment in neutrophil extracellular trap (NET) generation in response to phorbol 12-myristate 13-acetate (PMA) stimulation, which persisted into the acute post-injury phase (4–72 h). Reduced NET generation was accompanied by reduced reactive oxygen species production, impaired activation of mitogen-activated protein kinases, and a reduction in neutrophil glucose uptake and metabolism to lactate. Pre-treating neutrophils from healthy subjects with mitochondrial-derived damage-associated molecular patterns (mtDAMPs), whose circulating levels were significantly increased in our trauma patients, reduced NET generation. This mtDAMP-induced impairment in NET formation was associated with an N-formyl peptide mediated activation of AMP-activated protein kinase (AMPK), a negative regulator of aerobic glycolysis and NET formation. Indeed, activation of AMPK via treatment with the AMP-mimetic AICAR significantly reduced neutrophil lactate production in response to PMA stimulation, a phenomenon that we also observed for neutrophils pre-treated with mtDAMPs. Furthermore, the impairment in NET generation induced by mtDAMPs was partially ameliorated by pre-treating neutrophils with the AMPK inhibitor compound C. Taken together, our data demonstrate an immediate trauma-induced impairment in neutrophil anti-microbial function and identify mtDAMP release as a potential initiator of acute post-injury neutrophil dysfunction.</p

    Accelerating Chemical Discovery with Machine Learning: Simulated Evolution of Spin Crossover Complexes with an Artificial Neural Network

    Get PDF
    Machine learning (ML) has emerged as a powerful complement to simulation for materials discovery by reducing time for evaluation of energies and properties at accuracy competitive with first-principles methods. We use genetic algorithm (GA) optimization to discover unconventional spin-crossover complexes in combination with efficient scoring from an artificial neural network (ANN) that predicts spin-state splitting of inorganic complexes. We explore a compound space of over 5600 candidate materials derived from eight metal/oxidation state combinations and a 32-ligand pool. We introduce a strategy for error-aware ML-driven discovery by limiting how far the GA travels away from the nearest ANN training points while maximizing property (i.e., spin-splitting) fitness, leading to discovery of 80% of the leads from full chemical space enumeration. Over a 51-complex subset, average unsigned errors (4.5 kcal/mol) are close to the ANN's baseline 3 kcal/mol error. By obtaining leads from the trained ANN within seconds rather than days from a DFT-driven GA, this strategy demonstrates the power of ML for accelerating inorganic material discovery.United States. Office of Naval Research (Grant N00014-17-1-2956)United States. Department of Energy (Grant DE-SC0018096)National Science Foundation (U.S.) (Grant CBET-1704266

    Density functional theory for modelling large molecular adsorbate–surface interactions: a mini-review and worked example

    Get PDF
    First-principles simulation has played an ever-increasing role in the discovery and interpretation of the chemical properties of surface–adsorbate interactions. Nevertheless, key challenges remain for the computational chemist wishing to study surface chemistry: modelling the full extent of experimental conditions, managing computational cost, minimising human effort in simulation set-up and maximising accuracy. This article introduces new tools for streamlining surface chemistry simulation set-up and reviews some of the challenges in first-principles, density functional theory (DFT) simulation of surface phenomena. Furthermore, we provide a worked example of Co tetraphenylporphyrin on Au(1 1 1) in which we analyse electronic and energetic properties with semi-local DFT and compare to predictions made from hybrid functional and the so-called DFT+U correction. Through both review and the worked example, we aim to provide a pedagogical introduction to the challenges and the insight that first-principles simulation can provide in surface chemistry.National Science Foundation (U.S.) (ECCS-1449291

    Traumatic Injury and Exposure to Mitochondrial-Derived Damage Associated Molecular Patterns Suppresses Neutrophil Extracellular Trap Formation

    Get PDF
    Major traumatic injury induces significant remodeling of the circulating neutrophil pool and loss of bactericidal function. Although a well-described phenomenon, research to date has only analyzed blood samples acquired post-hospital admission, and the mechanisms that initiate compromised neutrophil function post-injury are therefore poorly understood. Here, we analyzed pre-hospital blood samples acquired from 62 adult trauma patients (mean age 44 years, range 19–95 years) within 1 h of injury (mean time to sample 39 min, range 13–59 min). We found an immediate impairment in neutrophil extracellular trap (NET) generation in response to phorbol 12-myristate 13-acetate (PMA) stimulation, which persisted into the acute post-injury phase (4–72 h). Reduced NET generation was accompanied by reduced reactive oxygen species production, impaired activation of mitogen-activated protein kinases, and a reduction in neutrophil glucose uptake and metabolism to lactate. Pre-treating neutrophils from healthy subjects with mitochondrial-derived damage-associated molecular patterns (mtDAMPs), whose circulating levels were significantly increased in our trauma patients, reduced NET generation. This mtDAMP-induced impairment in NET formation was associated with an N-formyl peptide mediated activation of AMP-activated protein kinase (AMPK), a negative regulator of aerobic glycolysis and NET formation. Indeed, activation of AMPK via treatment with the AMP-mimetic AICAR significantly reduced neutrophil lactate production in response to PMA stimulation, a phenomenon that we also observed for neutrophils pre-treated with mtDAMPs. Furthermore, the impairment in NET generation induced by mtDAMPs was partially ameliorated by pre-treating neutrophils with the AMPK inhibitor compound C. Taken together, our data demonstrate an immediate trauma-induced impairment in neutrophil anti-microbial function and identify mtDAMP release as a potential initiator of acute post-injury neutrophil dysfunction

    Link-INVENT: generative linker design with reinforcement learning

    Get PDF
    In this work, we present Link-INVENT as an extension to the existing de novo molecular design platform REINVENT. We provide illustrative examples on how Link-INVENT can be applied to fragment linking, scaffold hopping, and PROTAC design case studies where the desirable molecules should satisfy a combination of different criteria. With the help of reinforcement learning, the agent used by Link-INVENT learns to generate favourable linkers connecting molecular subunits that satisfy diverse objectives, facilitating practical application of the model for real-world drug discovery projects. We also introduce a range of linker-specific objectives in the Scoring Function of REINVENT. The code is freely available at https://github.com/MolecularAI/Reinvent

    Representations and Strategies for Transferable Machine Learning Models in Chemical Discovery

    Full text link
    Strategies for machine-learning(ML)-accelerated discovery that are general across materials composition spaces are essential, but demonstrations of ML have been primarily limited to narrow composition variations. By addressing the scarcity of data in promising regions of chemical space for challenging targets like open-shell transition-metal complexes, general representations and transferable ML models that leverage known relationships in existing data will accelerate discovery. Over a large set (ca. 1000) of isovalent transition-metal complexes, we quantify evident relationships for different properties (i.e., spin-splitting and ligand dissociation) between rows of the periodic table (i.e., 3d/4d metals and 2p/3p ligands). We demonstrate an extension to graph-based revised autocorrelation (RAC) representation (i.e., eRAC) that incorporates the effective nuclear charge alongside the nuclear charge heuristic that otherwise overestimates dissimilarity of isovalent complexes. To address the common challenge of discovery in a new space where data is limited, we introduce a transfer learning approach in which we seed models trained on a large amount of data from one row of the periodic table with a small number of data points from the additional row. We demonstrate the synergistic value of the eRACs alongside this transfer learning strategy to consistently improve model performance. Analysis of these models highlights how the approach succeeds by reordering the distances between complexes to be more consistent with the periodic table, a property we expect to be broadly useful for other materials domains

    On the Use of Automatically Generated Discourse-Level Information in a Concept-to-Speech Synthesis System

    Get PDF
    This paper describes the latest version of the SOLE concept-to-speech system, which uses linguistic information provided by a natural language generation system to improve the prosody of synthetic speech. We discuss the types of linguistic information that prove most useful and the implications for text-to-speech systems

    DockStream: a docking wrapper to enhance de novo molecular design

    Get PDF
    Recently, we have released the de novo design platform REINVENT in version 2.0. This improved and extended iteration supports far more features and scoring function components, which allows bespoke and tailor-made protocols to maximize impact in small molecule drug discovery projects. A major obstacle of generative models is producing active compounds, in which predictive (QSAR) models have been applied to enrich target activity. However, QSAR models are inherently limited by their applicability domains. To overcome these limitations, we introduce a structure-based scoring component for REINVENT. DockStream is a flexible, stand-alone molecular docking wrapper that provides access to a collection of ligand embedders and docking backends. Using the benchmarking and analysis workflow provided in DockStream, execution and subsequent analysis of a variety of docking configurations can be automated. Docking algorithms vary greatly in performance depending on the target and the benchmarking and analysis workflow provides a streamlined solution to identifying productive docking configurations. We show that an informative docking configuration can inform the REINVENT agent to optimize towards improving docking scores using public data. With docking activated, REINVENT is able to retain key interactions in the binding site, discard molecules which do not fit the binding cavity, harness unused (sub-)pockets, and improve overall performance in the scaffold-hopping scenario. The code is freely available at https://github.com/MolecularAI/DockStream
    • …
    corecore